arXiv:1504.01999v3 [math.PR] 20 Oct 2016 


RANDOM WALKS ON THE RANDOM GRAPH 


NATHANAEL BERESTYCKI, EYAL LUBETZKY, YUVAL PERES, AND ALLAN SLY 

Abstract. We study random walks on the giant component of the Erdos-Renyi 
random graph Q{n,p) where p = X/n for A > 1 fixed. The mixing time from a worst 
starting point was shown by Fountoulakis and Reed, and independently by Benjamin!, 
Kozma and Wormald, to have order log^ n. We prove that starting from a uniform 
vertex (equivalently, from a fixed vertex conditioned to belong to the giant) both 
accelerates mixing to O(logn) and concentrates it (the cutoff phenomenon occurs): 
the typical mixing is at (i/d)“^ logn ± (logn)^^^"’"®*-'^^, where u and d are the speed 
of random walk and dimension of harmonic measure on a Poisson(A)-Galton-Watson 
tree. Analogous results are given for graphs with prescribed degree sequences, where 
cutoff is shown both for the simple and for the non-backtracking random walk. 


1. Introduction 

The time it takes random walk to approach its stationary distribution on a graph is 
a gauge for an array of properties of the underlying geometry: it reflects the histogram 
of distances between vertices, both typical and extremal (radius and diameter); it is 
affected by local traps (e.g., escaping from a bad starting position) as well as by global 
bottlenecks (sparse cuts between large sets); and it is closely related to the Cheeger 
constant and expansion of the graph. In this work we study random walk on the giant 
component Ci of the classical Erdos-Renyi random graph Q{n,p), and build on recent 
advances in our understanding of its geometry on one hand, and random walks on trees 
and on random regular graphs on the other, to provide sharp results on mixing on Ci 
and on the related model of a random graph with a prescribed degree sequences. 

The Erdos-Renyi random graph 0{n,p) is the graph on n vertices where each of the 
( 2 ) possible edges appears independently with probability p. In their celebrated papers 
from the 1960’s, Erdos and Renyi discovered the so-called “double jump” in the size 
of Cl, the largest component in this graph: taking p = Xju with A fixed, at A < 1 
it is with high probability (w.h.p.) logarithmic in n; at A = 1 it is of order in 
expectation; and at A > 1 it is w.h.p. linear (a “giant component”). Of these facts, the 
critical behavior was fully established only much later by Bollobas [7] and Luczak [23] , 
and extends to the critical window p = (1 ±e)/n for e = as discovered in [7]. 

An important notion for the rate of convergence of a Markov chain to stationarity is 
its (worst-case) total-variation mixing time: for a transition kernel P on a state-space 
0 with a stationary distribution tt, recall \\p — u||tv = sup^(-fi[/r(A) — u(A)] and write 

= min{t : dTv(i) < e} where dTv(i) = max ||P*(x, •) — ttHtv ; 

X 

let tijix(e) and (t) be the analogs from a fixed (rather than worst-case) initial state x. 
The effect of the threshold parameter 0 < e < 1 is addressed by the cutoff phenomenon, 
a sharp transition from d^vit) ~ 1 to dTv(i) ~ 0, whereby tuixi^) = (1 + o(l))tMix(eO 
for any fixed 0 < e, e' < 1 (making tMix(e) asymptotically independent of this s). 
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Figure 1. Total-variation distance to stationarity from typical/worst 
initial vertices (marked red/blue) on the giant component ofG{n,p = ^). 


In recent years, the understanding of the geometry of Ci and techniques for Markov 
chain analysis became sufficiently developed to give the typical order of the mixing time 
of random walk, which transitions from n in the critical window ([26]) to log^ n at p = ^ 
on for A > 1 fixed ([5, 16]) through the interpolating order £~^ log^(e^n) when p = 
for <C e 1 ([H])- Of these facts, the lower bound of log^ n on the mixing 

time in the supercritical regime (A > 1) is easy to see, as Ci w.h.p. contains a path 
of clogn degree-2 vertices for a suitable fixed c > 0 (escaping this path when started 
at its midpoint would require (jlogn)^ steps in expectation). The two works that 
independently obtained a matching log^ n upper bound used very different approaches: 
Fountoulakis and Reed [16] relied on a powerful estimate from [15] on mixing in terms 
of the conductance profile (following Lovasz-Kannan [21]), while Benjamini, Kozma 
and Wormald [5] used a novel decomposition theorem of Ci as a “decorated expander.” 

As the lower bound was based on having the initial vertex vi be part of a rare bottle¬ 
neck (a path of length clogn), one may ask what is for a typical vi. Fountoulakis 
and Reed [16] conjectured that if vi is not part of a bottleneck, the mixing time is 
accelerated to O(logn). This is indeed the case for almost every initial vertex vi, and 
moreover then concentrates on cologn for cq independent of £ (cutoff occurs): 


Theorem 1. Let Ci denote the giant component of the random graph G{n,p = A/n) 
for A > 1 fixed, and let v and d denote the speed of random walk and the dimension of 
harmonic measure on a Foisson{X)-Galton-Watson tree, resp. For any 0 < e < 1 fixed, 
w.h.p. the random walk from a uniformly chosen vertex vi G Ci satisfies 


47x(e)-(i^d) Mogn 


<logi/2+o(i)^_ 


( 1 . 1 ) 


In particular, w.h.p. the random walk from vi has cutoff with a n window. 
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Figure 2. Mixing in total-variation vs. distance from the origin 
(rescaled to [0,1]). On left: random walk on a random regular graph 
mixes once its distance from the origin reaches the typical distance D. 

On right: the analog on a random non-regular graph, where mixing is 
further delayed due to the dimension drop of harmonic measure. 

Before we examine the roles of u and d in this result, it is helpful to place it in the 
context of the structure theorem for Ci, recently given in [11] (see Theorem 3.3 below): 
a contiguous model for Ci is given by (i) choosing a kernel uniformly over graphs on 
degrees i.i.d. Poisson truncated to be at least 3; (ii) subdividing every edges via i.i.d. 
geometric variables; and (iii) hanging i.i.d. Poisson Galton-Watson (GW) trees on every 
vertex. Observe that Steps (ii) and (iii) introduce i.i.d. delays with an exponential tail 
for the random walk; thus, starting the walk from a uniform vertex (rather than on a 
long path or a tall tree) would essentially eliminate all but the typical 0(l)-delays, and 
it should rapidly mix on the kernel (w.h.p. an expander) in time O(logn); see Fig. 1. 

It is well-known (see [8,13,18,28]) that D, the average distance between two vertices 
in Cl, is logi^n -|- Op(l), analogous to the fact that D = log^_in -|- Op(l) in G{n,d), 
the uniform d-regular graph on n vertices^ (both are locally-tree-like: G{n, d) resembles 
a d-regular tree while Q{n,p) resembles a Poisson(A)-GW tree). It is then natural to 
expect that coincides with the time it takes the walk to reach this typical distance 
from its origin ui, which would be v~^D for a random walk on a Poisson(A)-GW tree 

Supporting evidence for this on the random 3-regular graph G{n,3) was given in [6], 
where it was shown that the distance of the walk from vi after t = clogn steps is 
w.h.p. (1-|- o(l))(ut A F>), with u = 5 being the speed of random walk on a binary tree. 
Durrett [13, §6] conjectured that reaching the correct distance from vi indicates mixing, 
namely that tMixd) ~ 2v~^D for the lazy (hence the extra factor of 2) random walk 
on Q{n, 3). This was confirmed in [22], and indeed on Q{n, d) the simple random walk 
has tyuxi^) = logrf_i n-b 0(\/Iog~u), i.e., there is cutoff at v~^D with an 0{^ylogn) 
window (in particular, random walk has cutoff on almost every d-regular; prior to the 
work [22] cutoff was confirmed almost exclusively on graphs with unbounded degree). 

However, Theorem 1 shows that ^mix ~ (ud)~^logn vs. the v~^\ogy^n steps needed 
for the distance from vi to reach its typical value (and stabilize there; see Corollary 3.4). 
As it turns out, the “dimension drop” of harmonic measure (whereby d < logEZ unless 

^Here Op(l) denotes a random variable that is bounded in probability. 
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Figure 3. Harmonic measure on the first 7 generations of a GW-tree 
with offspring distribution F(Z = 1) = F(Z = 3) = ^. 


the offspring distribution Z is a constant), discovered in [24], plays a crucial role here, 
and stands behind this slowdown factor of (d/logA)“^ > 1. Indeed, while generation 
k of the GW-tree has size about (EZ)^, random walk at distance k from the root 
concentrates on an exponentially small subset of size about exp(d/c) (see Fig. 2 and 3). 
Hence, (ud)~^logn is certainly a lower bound on tMix (the factor translates time 
to the distance k from the root), and Theorem 1 shows this bound is tight on 0{n,p). 

The same phenomenon occurs more generally in a random walk on a random graph 
with a degree distribution {pk)^i, generated by first sampling the degree of each vertex 
V via an i.i.d. random variable with F{Dy = k) = pk conditioned on Dy being 
even, then choosing the graph uniformly over all graphs with these prescribed degrees. 

Theorem 2. Let G be a random graph with degree distribution {pk)^=i, such that for 
some fixed <5 > 0, the random variable Z given by E(Z = — 1) oc kpk satisfies 

F{Z > An) = o{l/n) for := exp [(logn)^/^"^] , (1.2) 

and let = (ud)~^ logn, where u and d are the speed of random walk and dimension 
of harmonic measure on a Galton-Watson tree with offspring distribution Z. 

(i) Set Wn = Vlog ri(log log n)^. If P 2 < 1 — (5 and 1 -|- <5 < EZ < K for an absolute 
constant K, then w.h.p. on the event that ui is in the largest component of G, 

d^f'^ (t* — Wn) > 1 — £ , whereas (t* -|- Wn) < £ • 

(a) Set Wn = \/logn. If Z > 2 and EZ < K for some absolute constants K, then for 
any £ > 0 there exists some 7 > 0 such that, with probability at least 1 — £ — o(l), 

drv ^ (t* - ^Wn) > 1 - £ , whereas d^f'^ (t* -|- 'ywn) < £ • 

Gondition (1.2) is weaker than requiring Z to have finite exponential moments. 
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The intuition behind these results is better seen for the non-backtracking (as opposed 
to simple) random walk (NBRW), which, upon arriving to a vertex v from some other 
vertex u, moves to a uniformly chosen neighbor w ^ u (formally, this is a Markov chain 
whose state-space is the set of directed edges in the graph). This walk has speed u = 1 
on a GW-tree (as it never backtracks towards the root), and on Q{n,3) it was shown 
in [22] to satisfy |tMix(£) — log 2 n| < C{e) for any fixed 0 < e < 1 w.h.p.—indeed, cutoff 
(with an O(l)-window) occurs once the distance from the origin reaches the average 
graph distance. If we instead take a random graph on 2n/3 vertices of degree 2 and 
n/3 vertices of degree 4, this corresponds to a GW-tree with an offspring distribution 
= 1) = F{Z = 3) = ^; since its k-th generation grows as 2^, the distance between 
two typical vertices is again asymptotically log 2 n. However, the probability that the 
NBRW follows a given path vi,V 2 ,---,Vk is WXjZi (with Zj denoting the number 
of children of Vi); setting d = ElogZ = log\/3 < log2, observe that ^j^^logZj 
concentrates around kd by GLT, and hence k ~ logn is a lower bound on mixing. 
(More generally, by Jensen’s inequality d = ElogZ < logEZ unless Z is constant.) 

This straightforward description of harmonic measure for the NBRW allows one to 
directly control the location of this walk in the random graph. Gonsequently, by adding 
a few ingredients to the approach originally used in [ 22 ], we were able to extend the 
NBRW analysis of that work to the non-regular setting (Theorems 4.1-4.2; similar 
results for the NBRW were independently obtained in [4]; see §4 for further details). 

However, the harmonic measure for the simple random walk (SRW) remains myste¬ 
rious, and there is no explicit formula for d even for very simple offspring distributions 
(see Fig. 3). Formally, let T be an infinite GW-tree rooted at p with offspring distribu¬ 
tion Z. Under our assumptions the random walk {Xt) is transient, so its loop-erasure 
defines a unique ray S,. Denoting graph distance by dist(-, •), let i/ be the asymptotic 
speed of the walk (well-defined for almost every tree; see [24, §3])) given by 

V lim j dist(W) p) ■ (1-3) 

t^oo ^ 

Let Ft denote the conditional probability given T, and set 

Wt{v) = — logET('i^ £ 0 • (1-4) 

Consider the metric d{^,r]) = exp(—A r]\) on all rays dT, where ^ A r/ is the longest 
common prefix of ^ and rj. It was shown in [24] that in the joint probability space of a 
GW-tree and SRW on it, 

WT^/t^d, (1.5) 

where d is non-random and is the Hausdorff dimension of harmonic measure in the 
above metric. Building on the results of [9,24,25], we establish the following refinement 
of (1.5), which plays a central role in our proof and seems to be of independent interest. 

Proposition 3. LetT be a Galton-Watson tree, eonditioned to survive, whose offspring 
variable Z satisfies 1 < EZ < oo. For every e > 0 there exists 6 > 0 so that, for all t, 

F {{WriCt) - dt\ > 5Vi) < e . ( 1 . 6 ) 

Indeed, the upper bound on fMix will hinge on showing that w.h.p. at a suitable time 
t = {udff^ log n+0{-\/\ogn) the L^-distance of SRW from equilibrium is exp(0(ydogJr)) 
— a term that originates from the 0(-v/t)-fluctuations of WT{ft) as per Proposition 3. 
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Organization and notation. In §2 we will establish Proposition 3 along with several 
other estimates for random walk on GW-trees, building on the works of [9,24,25]. 
Section 3 studies SRW on random graphs and contains the proofs of Theorem 1 and 2, 
while Section 4 is devoted to the analysis of the NBRW. 

Throughout the paper, a sequence of events An is said to hold with high probability 
(w.h.p.) if P(An) —)• 1 as n —)■ oo. We use the notation / <C 5 and / < 5 to abbreviate 
/ = o{g) and / = 0 ( 5 ), resp. (as well as their converse forms), and / x 5 to denote 
f g ^ f ■ Finally, in the context of an offspring distribution Z with EZ > 1, we say 
that r is a GW*-tree to refer to the corresponding GW-tree conditioned on survival. 


2. Random walk estimates on GAlton-Watson trees 


Let T be an infinite tree, rooted at some vertex /?, on which random walk is transient. 
In what follows, we will always use {Xt) to denote SRW on T and let the random variable 
^ G dT denote its limit, i.e., the unique ray that {Xt) visits i.o., or equivalently, the 
loop-erased trace of (W)^o- vertex u G T other than the root, we let v~ 

denote its parent in T, and let 9t{v) denote the incoming flow at v relative to its 
parent corresponding to harmonic measure: 

9t{v) = IP’r(T G ^ I G 0 ) (2-1) 

so that if (uo = /O, ui, • • ■, Ufc = v) is a shortest path in T then Er(T £ 0 = OiLi ^T{vi)- 
Our goal in this section is to establish Proposition 3 as well as the next two estimates, 
in each of which the underlying offspring distribution Z is assumed to have EZ > 1. 


Definition (Hitting measure). For a tree T rooted at p and an integer R, let Tr denote 
the tree induced on {u : dist(/9, u) < R}. For v G Tr, let Otji{v) denote the probability 
that SRW from p first hits level R of T at a descendent of v. 


Lemma 2.1. Let T he a GW*-tree rooted at p. There exists some c > 0 (depending 
only on the law of Z) so that for every R > 0 the following holds. With GW* -probability 
at least 1 — exp(—ci?), every child v of p satisfies 


6t{v) - Orniv) 


< exp(—cR), 


( 2 . 2 ) 


log6»r(u) - \ogeTji{v) 


< exp(—cR). 


(2.3) 


Lemma 2.2. Let T be a GW*-tree. There exists c > 0 such that, for all R,t > 0, 

E(dist(W,0 > R) ^ exp(—cR). 


2.1. Proof of Lemma 2.1. Let Zi be the degree of p, and denote the children of p as 
vi,V 2 ,... ,vzi. We will show that (2.2)-(2.3) hold for v = vi except with probability 
exp(—cR), and the desired statement will follow from a union over the children of p and 
averaging over Zi (multiplying said probability by the expectation of Zi w.r.t. GW*, 
which is 0(1)). 

Let Li be the set of leaves of the depth-(R — 1) subtree of Vi (i.e., descendants of Vi 
at distance (R — 1) from it) and let r = min{t ; Xt G UL*} be the hitting time of SRW 
to level R of T, so that 6th{v) = ^riXr G Li). Now let B = Ufc>o {^r+k = p} denote 
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the event that Xt revisits p = v after time r. Clearly, on the event we have = v 
iff Xr € Li, and so 


Oriv) - 0Tniv) 


< Ft{B) . 


(2.4) 


Estimating Ft{B) follows from the following result in [9, p21], which was obtained as 
a corollary of a powerful lemma of Grimmett and Kesten [17] (cf. [9, Lemma 2.2]). 


Lemma 2.3. There exist c,a > 0 such that the following holds. Let T be a GW*-tree 
and let R> 1. With probability at least 1 — ex.p{—cR), every depth-R vertex x satisfies 
that the ray P from the root to x has at least aR vertices v such that Pi,(rp = oo) > c, 
where Tp denotes the return time of RW to the ray P. 


By this lemma, there is a measurable set A of GW*-trees such that P(^) < exp(—ci?) 
and for every T ^ A, every z G UL, satisfies the following: there are at least aR vertices 
on its path to p so that the SRW from such a vertex y has a probability g > 0 of escaping 
to oo never again visiting its parent y~. In particular, 

^t{B)1t^A < (1 - < exp(-c'ii) 

for some c' > 0, which we take to be at most c/2. This establishes (2.2). 

It remains to show (2.3). Let jt be the probability that SRW on T does not revisit 
the root beyond time 0 , and notice that by definition, for every i, 

Oxivi) A Oxjiivi) > -nlZi , 


where 7 * is the copy of yr corresponding to the subtree rooted at u*. By [24, Lemma 9.1], 

1 


E[1/7] < 


= 0 ( 1 ). 


1 -E[l/2'] 

Define the event 

^ = >4 U {71 < U [Zi > , 

and observe that P(^) = 0 (e“ 4 ^^) by Markov’s inequality. 

On the event A'^, 


log 6 »T('y) - log OTjiiv] 
establishing (2.3). 


< 


\^t{v) - 9Tiiiv)\ ^Ft{B) ^ e _ „-ic' 


< 


9t{v) /\6tj^{v) 1\IZ\ g 4 ^^'^ 


= e 4 


R 


2.2. Proof of Lemma 2.2. Following [24], consider the Augmented GW-tree (AGW), 
which is obtained by joining the roots of two i.i.d. GW-trees by an edge. A highly useful 
observation of [24] is that the process in which SRW acts on (T, p') by moving the root 
to one of its neighbors is stationary when started at an AGW-tree T and one of its 
roots. Glearly, the probability of the event under consideration here in the GW-tree is, 
up to constant factors from below and above, the same as the analogous probability in 
the AGW-tree (with positive probability the walk never traverses the edge joining the 
two copies; conversely, a union bound can be applied to the two GW-tree instances). 
For brevity, an AGW*-tree will denote an AGW-tree conditioned to be infinite, for 
which the above stationarity property w.r.t. SRW is still maintained. 
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The event dist(Xi,^) > R implies that, if u is the vertex at distance R from Xt on 
the loop-erased trace of {Xq,Xi^ ... ,Xt) (the result of progressively erasing cycles as 
they appear) then the SRW must revisit u at some time s > t. 

The stationarity reduces our goal to showing that = u}) < exp(—ci?), 

where u is the vertex at distance R from Xq on the loop-erased trace of (Xq, ..., X_i). 
Condition on the past of the walk. By Lemma 2.3, with probability 1 — 0(exp(—ci?)) 
the AGW*-tree T satisfies that the path from p to u contains some aR vertices from 
which the walk would escape and never return to this path with probability bounded 
away from 0 , whence the probability of visiting u at a positive time is at most exp(—ci?) 
for some c > 0 . ■ 

2.3. Proof of Proposition 3. A regeneration point for the SRW on the GW*-tree is 
a time r > 1 such that the edge (Xt-i, Xt) is traversed exactly once along the trace of 
the random walk. Set tq = 0 and let ri < T 2 < ... denote the sequence of regeneration 
points on a random GW*-tree T. The following remarkable result due to Kesten was 
reproduced in [27] (see also [24,25] where the first part was observed). 

Lemma 2.4. Let T be a GW*-tree rooted at p, let (rj)j>o he the regeneration points of 
SRW on it, and let pi = dist(/ 9 , denote their depths in the tree. 

(a) Let Ti (i > 1) denote the depth-{pi — Pi-i) tree that is rooted at Xt-._;^. Then 
{(Tj, {Xt)T,_^<t<Ti)}i>i are mutually independent, and for i > 2 they are i.i.d. 

(b) There exist some a > 0 such that Eexp[Q;((/9j — Pi-i)] < oo. 

By the standard decomposition of a GW*-tree into a GW-tree with no leaves (the 
skeleton of the tree) and critical bushes (see, e.g., [ 2 , § 1 . 12 ]), together with the fact 
that the harmonic measure is supported on the skeleton, we may assume throughout 
this proof that P(Z = 0 ) = 0 . 

Let Pi = dist(/ 9 , Xt-J {i > 1) as in the above lemma (noting that necessarily X^-- = 
as regeneration points are by definition part of the loop-erased trace f) and set 

Yi = -\ogeT{f.i), 

so that our goal is to show the concentration of Yl\=i = fk’T( 6 )- Next we show that 
Yi has exponential moments. If ui,..., vzi are the children of the root (recall Z\ > 0) 
then = E*=i Ui=v. .6rp^{vi) with the summands being i.i.d. given Zi. Hence, 

E [0^1(6) I ^i] = ^lE I Zi] = ZiE [U=v, I T] I Zi] = Zi, 

and it follows that 

Eexp(yi) = E [0 t(6)"^] = = 0(1) • 

By [24, Theorem 8.1] there exists a probability measure GWharm on GW-trees which 
satishes the following. It is stationary for the harmonic flow 6t, mutually absolutely 
continuous w.r.t. GW (moreover, GWharm/GW is uniformly bounded from above), and 
under GWharm one has E[lLT(^t)] = We will show that 

Var(lTr(6)) < C,t w.r.t. GWharm xSRW (2.5) 

for some constant Cz that depends only on the law of the offspring variable Z, where 
GWharm x SRW is the joint distribution of a GWnARM-tree and SRW on it. A direct 
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application of Chebyshev’s inequality will then imply (1.6) under GWharmxSRW (for 
<5 = \/ Czle). The analogous statement for GWkSRW will then follow by the absolute 
continuity: for every e > 0 there exists e' such that if the event in (1.6) has probability 
at most e' under GWharmxSRW then it has probability at most e under GWkSRW. 

It remains to show (2.5). Note that the bounds established above on under GW 
remain valid under GWharm thanks to the fact that (iGWHARM/dGW= 0(1). 

Since the sequence {Yi)i>i is stationary under GWharm xSRW, it suffices to show that 
there exist some constants c, c' > 0 such that Gov(li,lfc) < cexp(—c'fe) for all k. Let 
Tik/ 2 \ be the depth-[A:/2j subtree of T (sharing the same root), and write 6 {v) = Oxiv) 
and 9{v) = (^) brevity. Denoting the children of the root by ui,..., vz^, recall 

from Lemma 2.1 (with R = [A:/2j) and Lemma 2.4 that there exist some ci,C 2 > 0 
(depending only on the offspring distribution Z) such that the events 61,82 given by 

£ 1 = (j {\Hvi) - G{vi)\ > , T 2 = > k/ 2 ] 

l<i<Zi 


satisfy P(Ti UT 2 ) < exp(—C 2 A:). It follows from Holder’s inequality that 
Gov (Hi le„Yk) < [Ey2] i < exp(-C 2 fc) 

(recall L) has finite moments of every order), and similarly for Gov (lil^-c, l^l^-j); 
hence, to estimate Cov(yi,yfc) it remains to bound Cov(lil£-c, Y), l£|)- 
On the event £{, either (i) we have 6 {vi) < exp(—|ciA:), whence 

e{vi) log^ 9{vi) < and 9{vi) log^ 9{vi) < , 


or (ii) we have 9{vi) > exp(—fci/c) and infer from the bound on 9{vi) — 9{v, 


that 


log9{vi) - log9{vi) 


< 


o-cik 


9{vi) A 9{vi) 




where in both cases the implicit constants are independent of k. Hence, if we let 
assume the value log(l/ 0 (uj)) with probability 9{vi) for each child Vi of the root of T, 


E|yil^c-y/|"<e-5'’i^EZ<e 


.-jClfc 


thus (recalling that EY^ = 0 ( 1 )), 

Cov{Yil£c - Yl, Ykls^) < ^E{Yils<^-Y{fEYi < . 

Thanks to the discrete renewal theorem—crucially using that the renewal intervals ipi 
have exponential moments (see [19] as well as [20, §11.4])—we can couple Tfclfi with an 
i.i.d. copy of Yi with probability 1 —0(exp(—cs/c)), and so Cov(y/, < exp(—C 3 A:). 

This completes the desired bound on Cov(yi, Y^) and concludes the proof. ■ 
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3. Random walk from a typical vertex in a random graph 


We begin by studying SRW and LERW on an infinite GW-tree along what would 
be the cutoff window. This analysis will then carry over to SRW on G via coupling 
arguments, and provide initial control over the distribution of the walk within the cutoff 
window. The hnal step will be to boost mixing in that window via the graph spectrum. 


3.1. Quantitative estimates on the infinite tree. Fix e > 0 and set 


m = n exp 


-7\/log 


n 


4 = ^logm = - log n - O(v^logn), 
ii=io + 

where 7 > 0 is a suitably large constant with the following two properties: 
(i) By Proposition 3, for large enough 7 (in terms of the law of Z and e), 


(wri^eo) > logm - l'y^ylogn^ 
{WriCii) < logm + I'y^/logn 


> 1 — e . 

> I — e . 


(3.1) 

(3.2) 

(3.3) 

(3.4) 

(3.5) 


(ii) It is known (see [27]) that the distance of SRW (W) from the root (its origin) of 
GW-a.e. tree has mean vt + 0(1) and variance at most Czt for some fixed Cz > 0 
(which depends only on the offspring variable Z). Thus, 

P ^|dist(p, W) — < Y|g 7 \/log> 1 — e for any t < 

provided 7 is large enough in terms of d, Cz and e. Gombined with the exponential 
tails of the regeneration times (recall §2.3), taking 7 large enough further gives 


P ( max |dist(p, W) - < j^ 7 \/logn ] 

\t<v-^l\ J 

Let Co > 0 be some constant (to be specified later), and set 

R = Co log log n . 


> 1 — e . 


(3.6) 

(3.7) 


Definition. For a tree T and integer R > 0, if uq = p,ui,... ,Uk = u is the shortest 
path from the root to a vertex u and 9 is as defined in ^2, define 

k 

Wriu) = > 

i=l 

where Tii{uf) is the depth-R subtree of T rooted at the parent ofui. 

Gonsider the exploration process where at step k — corresponding to in the 
filtration — we expose the depth-(/c + 1 ) descendants of the root, as in the standard 
breadth-first-search process, with one modification: we explore the children of a vertex 
V (which is at depth k) only if its distance-R ancestor, denoted u, satisfies 


Wt{u) < log m -|- ^ 7 Y^logre . 


(3.8) 
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Instead, if (3.8) is violated, we declare that the exploration process is truncated at all 
the depth-i? descendants of u, and denote this J^fc-measurable event by Btr{u). 

We stress that, when determining whether to truncate the exploration at the vertex 
V as described above, the variable Wt{u) is Jfc-measurable (as the event Btriui) has 
not occurred for any of the ancestors of u, thus the subtree of depth R of each of them 
has been fully explored). 

Lemma 3.1. Let T be a GW*-tree, and define i\, 7 and Btr as above. If cq from (3.7) 
is sufficiently large, then 

!?( U Btfiik]) < IP (Wr(^£i) > \ogm+ ^yy^log+o(l) < e + o{l). 

k<£i 

To prove this result, we need the following simple claim. 

Claim 3.2. Let T be a GW*-tree. There exist ci,C 2 > 0 such that the following holds. 
For every measurable set A of trees and integer I, 

-|- exp . 

i<l 

Proof. It suffices to prove this for the AGW*-tree (as argued in the proof of Lemma 2 . 2 ) . 
Let (W)^o simple random walk on T whose loop-erased trace is Then 

<p( U {l?(>o(^t) e^})+p( U {dist(Wi,p) <£}) 

i<£ t<2llv t>2llv 

< ‘^¥{A) + exp , 

t>2tlv 

using the stationarity of the AGW*-tree and the large deviation estimate from [27] on 
the speed of random walk on a GW-tree (see also [9]). ■ 


Proof of Lemma 3.1. Note that the right-hand inequality follows from the choice of 
7 as per (3.5), and it remains to prove the left-hand inequality. 

Let A denote the set of trees T for which there exists a child v of the root such that 


logOTj^iv) - logB t{v) 


> exp(—ci?) 


for c > 0 the constant from Lemma 2.1. Note that P(.4,) < exp(—ci7) by (2.3) in that 
lemma. Furthermore, on the event 


k-l 


£k := n ^ A} )n Btriik) 


i=0 


we have 

logOT^iii) - logOTiii) < exp(-ci?) for alH = 1,..., A:. 
Thus, by the triangle inequality, the event £k implies that 

Wt( 4) - Wr(4) < ke~^^ < e~''^\ogn = ©(log-^n) , 
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provided that k = O(logn) and cq > 3/c in the definition (3.7) of R. Therefore, by the 
truncation criterion (3.8), we find that £k implies that 

bhT(^fc) > logm + ^ 7 -y/logn — 0 (log“^ n) > logm + ^yy/logn 

for sufficiently large n, whence (using that WT{Ck) < l^r(^£i) for all k < ii), 

[J £k C |Wt( 6 i) > logm + 57 v/logn| . (3.9) 

k<ii 

On the other hand. Claim 3.2 shows that 

p( U ci£ie-=« + = 0 ( log -2 n) . (3.10) 

i<ll 

Since, by the definition of £k, 

U ^tr(4) c ( U u ( U {Too(^*) e ^} ) , 

k<ii k<t\ k<i\ 

combining (3.9)-(3.10) completes the proof. ■ 

Let Sk {k < li) denote the set of explored vertices at distance k from the root (so 
that Sk is Tfc-measurable), and define the T^+ij-measurable by 

S'k = {u&Sk : . 

By the definition of the truncation criterion, for every u € 

—Wt{u) > — logm — 27 \/logn . 

It is easy to see (e.g., by induction on k) that 

exp(— 1 Tt(u)) < 1 . 

ues’. 

In particular, 

thus it follows from (3.1) that 

\Sk\ < mexp ^^ 7 Y^log= nexp ^ 7 \/log ■ 

Therefore, as the maximum degree is at most A = exp[(log n)^/2-'5j^ number of 
vertices reached in the first ii exploration rounds (i.e., those revealed in T^^) is at most 

l-S’fcl < ISfclA^ < .^inexp < nexp (^-57\/fog , (3-11) 

k<ii k<ii 

using the fact A-^ = exp[0((logn)^/2-^ Joglogn)] = exp[o(-v/fog™)]- (We stress that 
the estimate (3.11) holds with probability 1). 

Finally, recall from (3.4) that 1 Tt(^£o) > logm — ^^yiogn with probability at least 
1 — e. Since Lemma 3.1 implies that IP(C 4 ^ > 1 — e — o(l), it therefore follows 

(once we recall from (3.1) that ^ exp (gyVlog n) = ^ exp (ly-y/logn)) that 

F{^,^GSy)>l-2e-o{l) 
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for C defined by 

5’"o = {u^ : Pt(^£o = «) < ^ exp (|7\/log n) } . (3.12) 

3.2. Coupling the GW-tree to the random graph. We describe a coupling of 
SRW (Xt) on a GW-tree^ T, started at its root p, to SRW (^t) on G, started at a 
fixed vertex vi. Let T C T be the truncated GW-tree from §3.1, and let r be the time 
at which the SRW {Xt) on T reaches level f'l. We will first construct a coupling of G 
and T, in which we will specify a tree F C G rooted at vi and a one-to-one mapping 
0 : f —)• T, such that 

degQ(x) = (iegrp{(f){x)) for all x G f , (3.13) 

(where degQ{u) accounts for multiple edges). Moreover, defining, for k > 1, the event 

Hfc := f|{WG</>(f)}, 

t<k 

we will show that 

P(n^) > 1-e-o(l). (3.14) 

In what follows, we refer to the event 11^ as a “successful coupling” of Xt)^^i, for 
the following reason: denoting f = min{t : Xt ^ </>(r)}, take for t < f, 

let choose a uniform neighbor of ^f-i among those that belong to G \ F (recalling 
that degQ(.^f-i) = deg 2 ’(Xf_i)) and run ^t independently of Xt for t > f. This 
yields the correct marginal of SRW on G by (3.13). When using this coupling, on the 
event 11,- one has ^t = for all t < t. 

Gonstruct the coupling of {G, T) by exposing, in tandem, the truncated GW-tree 
T and a subgraph of the exploration process of vi in G via the configuration model. ^ 
Initially, we let degQ(ui) = degj~{p) and set F = {ui} and </>(ui) = p. We proceed by 
exposing, vertex by vertex, the tree T, and attempting to find F in G such that F is 
isomorphic to a subtree of T rooted at p, while maintaining that the queue of active 
vertices (ones that await exploration) in G will always be a subset of F. 

Suppose we are now exploring the offspring of 2: G 0(F). Note one can map the 
offspring yi,... ,yr of z one-to-one to the half-edges of u = 0“^(z) in G, a currently 
active vertex in the exploration queue, with (3.13) in mind. 

(a) If the offspring of z are to be truncated, remove u from the exploration queue of G 
(do not explore its half-edges). Otherwise, process yi,... ,yr sequentially as follows. 

(b) If the half-edge corresponding to y, is matched to some previously encountered 
vertex v in the exploration of G (i.e., the edge {u,v) formed by this pairing closes 
a cycle in G). In this case, mark both u and v by saying that the events Bcydu) 

^More precisely, the degree of the root will be different—given by {pk) as opposed to the size-biased 
law of Z —but this has no effect since this model is clearly contiguous to the standard GW-tree. 

Q 

Every vertex u is associated with a number of “half-edges” corresponding to its degree, and the 
(multi)graph is generated by a uniform perfect matching of the half-edges (see, e.g., [8]). In our context, 
we reveal the connected component of vi by successively revealing the paired half-edges of vertices in a 
queue (breadth-first-search). If multiple edges or loops are found, or the component is exhausted with 
at most y/n vertices, we abort the analysis (rejection-sampling for having vi € Ci(G)). 
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and Bcyc{v) hold, and remove u and its offspring from the exploration queue of G. 
In addition, we delete u and its offspring from F. 

(c) If, on the other hand, the half-edge corresponding to yi is to be matched to a 
vertex v that has not yet been encountered, we sample degrp{yi) and deg(^(i;) via 
the maximal coupling between their distributions (the former has the law of Z, and 
the law of the latter is a function of the remaining vertex degrees in G). If this 
sample has deg^(?/j) = degg(u), then we add u to F and set = y*; otherwise, 
we say 13deg{v) holds, and remove v from the exploration queue. 

The above construction satisfies (3.13) by definition. To verify (3.14), we must estimate 
the probability that one of the following events occurs: (a) the walk (Xt) visits a 
truncated vertex (i.e., Btr{Xt) occurs for some t < r), (b) its counterpart {^t) visits a 
vertex with non-tree-edges (i.e., Bcyd^t) occurs for some t < r), or (c) the walk (Xt) 
visits some vertex y £ T such that its counterpart in G had a mismatched degree (that 
is, Bdegi^t) occurs for some t < t). 

The estimate of the event in (a) will follow directly from our results in §3.1. For 
the sake of estimating the events in (b)-(c), and in order to circumvent potentially 
problematic dependencies between the random walk and the random environment, we 
will estimate the following larger events: if denotes the distance-i? ancestor of a 
vertex tc in a tree, then let 

B{w) = IJ |Scyc(?^) UBdegiv) : V G fR+i(u;^)| , 

that is, we test whether there exists a vertex with non-tree-edges or a mismatched degree 
in the entire depth-i? subtree of the distance-(i? — 1) ancestor of w. We will estimate 
the probability oi B = IJiio where q is the first time that dist(,^, ui) = R + i- 

We first reveal the first R + 1 levels of F and T, and examine whether Bcydv) or 
Bdeg{v) hold for any one of their vertices. (If so, then B occurs due to i = 0.) For each 
z > 1, we reveal level R + i + 1 of F and T in the following manner: we first run until 
Q, then examine whether Bcydv) or Bdegiv) hold for some v that is a depth-(i? -|- 1) 
descendent of (if so, 13 occurs) and proceed to reveal the rest of this level. 

FTsing this analysis, we guarantee that we examine whether Bcydv) U Bdeg{v) occur 
for vertices u in a given level, only at the first time that the walk has reached that 
level (thus preserving the underlying independence in the configuration model). In 
addition, if leaves F for the first time on account of either Bcyd^t) or Bdegi^t)-, 
then either B holds, or IXt must have revisited some vertex u after reaching one of its 
depth-i? descendants. The latter occurs by time t with probability at most texp(—ci?) 
by Lemma 2.2, which is o(l) for t x log n provided that the constant cq in the definition 
of R is sufficiently large. 

(a) Avoiding a truncation: By definition, a truncated vertex v corresponds to a distance- 
R ancestor u for which the event Btr{u) holds, and by Lemma 3.1 the probability 
that the LERW ^ visits such a vertex u by time ii is at most e -|- o(l). 

In the event that the LERW ^ does not visit such a vertex, the SRW (Xt) must 
escape from the ray ^ to distance at least R in order to hit a truncated vertex, a 
scenario which has probability at most exp(—cR) by Lemma 2.2; thus, the overall 
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probability of the SRW hitting a truncation by some time t x log n is at most 

< e“'^^logn = o(l) 

provided the constant cq in the definition of R is chosen large enough. 

(b) Avoiding a cycle: A set S of size s vertices in the boundary of the exploration 
process of G has at most As half-edges that are to be matched in the next round. 
A non-tree-edge at a given vertex u G S' is generated by either 

1. matching a pair of its half-edges together — which has probability 0(A^/n) as 
long as the number of remaining unmatched half-edges has order n, or 

2. pairing one of them to another unmatched half-edge of S (the more common 
scenario inducing cycles) which has probability 0(sA^/n). 

By (3.11) we know that s < n exp(—^y-^/log n) = o{n/A) and hence, indeed, the 
number of unmatched half-edges that remain at time ii is of order n. Thus, to 
bound the contribution to F{B) due to some Bcydv), we estimate the probability 
of encountering such a vertex along some £iA^+^ trials. A union bound yields that 
this contribution has order at most 

4A^+^-— < exp (^-iyy^logn -h o[-\/\ogn)j = o(l). 

(c) Coupling vertex degrees: While the tree uses a fixed offspring distribution Z, the 
exploration process in G dynamically changes as the half-edges are sampled without 
replacement. Let denote the number of vertices of degree k in G, and note that 

\Nk - npk\ > 4 (^y/npk logn V log ) = 0 

by a standard large deviation estimate for the binomial variable Nk Bm{n,pk), 
e.g., [18, Theorem 2.1], where the O(-) includes an extra factor of 2 due to the 
conditioning that the sum of the degrees, should be even. We may thus 

condition on the degrees {Du)uev (whence G becomes a random graph with a given 
degree sequence) and assume, by neglecting an event of probability o(l), that 

\Nk — npk\ < A{-dnpk log n V log n) for all fc = 1,..., A (3.15) 

and that Nk = 0 for k > A. Further observe that, by Cauchy-Schwarz, 

A 

y^^iVnpk logn V logn) < y/ An log n V A logn < rdG+oA) _ (3.16) 

k=l 


Letting Nk denote the number of vertices with k unmatched half-edges at some 
point in time, a given half-edge gets matched to a vertex of degree j with probability 
jNj/ 'dZf, kNk- Denoting the last variable by Z, and note that 


Nj-npj 


j 

< A ^ |5j| -k < An exp (^-^y-N/log n^ , 




(3.17) 
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where the inequality between the lines used (3.15)-(3.16), and the last inequality 
used (3.11). Thus, 


\\F{Z e ■)-FiZ € 


E 

i<A 

E.>- 

j<A 


JPj 




kPk 'Zk ^^k 
Pj Efc kNk - Nj Y.k kpk 


Efc kpk Efc kNk 


which, if we write E = k{Nk — npk), is equal to 


Ei 


J<A 


npj - Nj 


+ 


Ni 


^ + 1^1 


i<A 




nEkkPk nJ2kkPkEkkNk 

for large n, using kNk > (1 — o(l))n. By (3.17) and the hypothesis on A, 

\\F{Z e •) - E OIItv < (A + A2)exp (-|7\/logn^ • 

A union bound over I*! A^"*"^ trials of examining i3deg(p) shows that this contribution 
to P(B) is also o(l). 


3.3. Lower bound on mixing time in Theorem 2. Set 

= ^ = (^1 - m 7 \ A ^) , 

and observe that (3.6) implies that dist(p, X^-) < — ( 8 d)“^ 7 T/logn with probability 

at least 1 — e. By Lemma 2.2, applied in a union bound over t <t~ = O(logn), w.h.p. 

max{dist(Xt, ^) : t < t~} < R (3.18) 

for a large enough cq > 0 (depending only on the distribution of Z) in the definition (3.7) 
of R. It follows that upon a successful coupling, is within distance R = O(log log n) 
from the set of vertices corresponding to the vertices exposed in ii rounds of 

our truncated exploration process of the GW-tree. Thus, by (3.11), the distribution of 
given a successful coupling has a mass of 1 — e on at most 

A^ 5 Z|S,| =o(n) 

j<ri 

vertices, yielding the required lower bound. ■ 


3.4. Upper bound on mixing time in Theorem 2. 

3.4.1. The case of minimum degree 3 (Theorem 2, Part (ii)^). Set 

. 

We claim that, if Pq is the transition matrix of the SRW {3Zt), then for large n, 


E 

v&V{G) 


p; 


t+ 


(vi,v) A — exp 
n 


|7V^ 


n 



> 1 


(3.19) 
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To see this, first define the event T = TinT 2 nT 3 , where 


Ti = 

T 2 = 

T3 = 


{^0 + ^ dist(p,Xi+) <l\- j^7v^logn| , 

are successfully coupled | , 

= (loop-erased trace of {Xt)t<t+)i^ ^ ^ ‘S'^} for {Xt) 


on T, 


where is as defined in (3.12). Recall from (3.6) that P(Tf) < e; we established 
in §3.2 that P(T 2 n Ti) < e + o(l) (utilizing the upper bound portion of the event 
Ti); and as for T 3 , the first event in that intersection occurs w.h.p. on the event of 
the lower bound on dist(/ 5 , Xj+) from Ti (otherwise there would be some t < a,i 
which dist(Xt,^) > y/Yo^, which is unlikely as argued above (3.18)), and therefore 
P(T§ n Ti) < 2e + 0 ( 1 ) by the bound above (3.12). Overall, 


P(T") < 4e + 0 ( 1 ), 


and by Markov’s inequality, for sufficiently large n we have 

p(^Pg(T) > 1 - > 1 - V^. 

In particular, with the definition of in mind, with probability at 

Pg i^t+ = ■y , Tf" I =^0 = 'I’l) A iexp (^|7v^logn^ 

veViG) 

> PG(^t+=u,T|So = ui)>PG(T)>l 

v£V{G) 


least 1 — y/^, 


- , 


and (3.19) follows. 

We now reveal G, and can assume that the event in (3.19) occurs. That is, if 
we set fi{v) = Pq {vi,v) A 7 exp (| 7 A/logn) then 1 — < y/^- With the fact 

7r(u) > l/(An) in mind, we infer that 

■) “ ^IItv < V^+ II/tTg “ ^IItv < + W^Pg - ^L^iw) , 

and 

||/r/7r-^(G)||^2(^) <exp(^(| + o(l)) 7 Vlogn) . 

In the setting where pi = p 2 = 0 we can now conclude the proof, using the well-known 
fact that G is then w.h.p. an expander and by Cheeger’s inequality the spectral gap of 
the SRW on G, denoted gap, is uniformly bounded away from 0 (see [I] on expanders 
and, e.g., [10, Lemma 3.5] (treating the kernel of the giant component) for the standard 
argument showing expansion for a random graph with said degree sequence). Indeed, 
if we take 

s = cgap"^ {^y/\ogn + log(l/e)) (3.20) 

for a large enough absolute constant c > 0 , then we will have 

•) - ttIItv < + e , 


as desired. 
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3.4.2. Allowing degrees 1 and 2 (Theorem 2, Part (i)J. The general case requires one 
final ingredient, since G is no longer an expander. Instead of G, consider the graph G' 
on n' < n vertices where every path of degree-2 vertices whose length is larger than 
R = cq log log n is contracted into a single edge, and similarly, every tree whose volume 
is at least R is replaced by a single vertex. 

Let U be the set of vertices in V{G) \ V{G') as well as their neighbors in G. We 
claim that, if u is a uniform vertex conditioned to belong to Ci{G) and 

Bu,t = \J{XkeU}, 

k<t 

where Xj. is SRW from v, then Fy{13m^^2 = o(l). This will reduce the analysis to G', 

as clearly on the event 13(j^ we can couple the walks on G and G' up to time t. Since 

Cl is linear w.h.p., it suffices to take Xq to be uniform over V, i.e., to show that 

= (3.21) 

vev 

The fact that p 2 is uniformly bounded away from 1 implies that 

P(u G U I {Du)uev) < D^(logn)“^° (3.22) 


for every vertex v provided we choose the constant cq to be sufficiently large (for the 
long paths this uses the decay of P(y > R) where Y ~ Geom(p2)j whereas for the 
hanging trees this uses tail estimates for the size of subcritical GW-trees (see [14])). 

Condition on the degree sequence {Du)ueVi let Nk denote the number of vertices 
of degree k in G, and assume (3.15) holds, neglecting an event of probability o(l) as 
described above that inequality. Since Ylk ~ 0{1) thanks to our assumption on EZ, 
it follows (see (3.15)-(3.16)) that = (1 + o{l))nY(ik^Pk = 0{n). In particular, 

for every v £ V we have 7 r{v)~^ = {'^u^u)/Dy = 0{n). Therefore, for every t, 


n 


{Bu,t I {Du)uev) ^ I iDu)uev) 


= E 


^ 7r(u)P, ( U {Xfc G [/} I g) I {Du)uev < t ^ 7r(u)P {v £ U \ {D^Uev) , 


k<t 


where the last step combined a union bound with the stationarity of the SRW. Let 
A = {v ■. > log^n}, and notice that Ylk>\og^ n^Pk — 0(l/log^n) using EZ = 0(1), 

which, by (3.15)-(3.16), yields 7 t{A) = 0(l/log^n). The above display then gives 

- E^ i^u,t I {Du)uev) < tTTiA) + ^ P(i; G G I (Du^v) < ^ , 

where the last inequality used (3.22) to replace each summand in the sum over v £ A^ 
by Dy{logn)~^^ < 1/log^n. This establishes (3.21). 

The proof is concluded by noting that the kernel of G' is an expander w.h.p., and the 
effect of expanding some of its edges into paths of length at most R, or hanging trees of 
volume at most R on some of its vertices, can only decrease its Cheeger constant to c/R 
for some fixed c > 0. Hence, by Cheeger’s inequality, the spectral gap of G' satisfies 
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gap > c/P? for a fixed c > 0 (depending only on the degrees of G'), and thereafter 
taking s as in (3.20) (noting that s x \/log n(log log n)^ now) establishes the following: 
for every e > 0 there exists some 7 > 0 such that, with probability at least 1 — e — o(l), 

(t* - > 1 - e , whereas (t* + < e , 

where Wn = \/log n(log log n)^. This immediately implies cutoff, w.h.p., for any choice 
of a sequence Wn such that w'^ = o{wn)- ■ 

3.5. Random walk on the Erdds-Renyi random graph: Proof of Theorem 1. 

The proof will follow from a modification of the argument used for a random graph on 
a prescribed degree sequence, with the help of the following structure theorem for Ci. 

Theorem 3.3 ([12, Theorem 1]). LetCi be the largest component ofQ{n,p) forp = X/n 
where X > 1 is fixed. Let p, < 1 be the conjugate of X, i.e., = Ae""^. Then C\ is 

contiguous to the following model Ci: 

1. [Kernel] Let A ~ M {X — p,l/n), take Du ~ Poisson(A) for u G [n] i.i.d. and 
condition that Du1d„>3 is even. Let Nk = #{u : Du = k} and N = J2k>3 ^k- 
Draw a uniform multigraph 1C on N vertices with vertices of degree k forfi > 3. 

2. [2-Core] Replace the edges of JC by paths of i.i.d. Geom(l — p) lengths. 

3. [Giant] Attach an independent Poisson{p)-Galton-Watson tree to each vertex. 

That is, P(Ci G Al) —>■ 0 implies P(Ci G Al) —)■ 0 for any set of graphs A. 

The first point is that we develop the neighborhood of a fixed vertex of G (not of Ci) 
as a GW-tree with offspring distribution Z ~ Poisson(A) for A > 1 fixed. Should its 
component turn out to be subcritical, we abort the analysis (with nothing to prove). 

The second point is that, when analyzing the probability of failing to couple the 
SRW on G vs. the GW-tree, mismatched degrees are no longer related to sampling 
with/without replacement from a degree distribution, but rather to the total-variation 
distance between Poisson(A) and Bin(n', X/n) where n' is the number of remaining 
(unvisited) vertices. By (3.11) we know that n' > n(l—exp(—^y-v/logn)), and therefore, 
by the Stein-Chen method (see, e.g., [3, Eq. (1.23)]), this total-variation distance is at 
most X/n -\- ||Poisson(A) — Poisson(An'/n) < (logn)"^*^ (with room to spare). 

The final and most important point involves the modification of G into G', where 
the spectral gap is at least of order R~^ where R = cq log log n for some suitably 
large constant cq > 0. We are entitled to do so, with the exact same estimates on the 
probability that SRW visits the set U of contracted vertices, by Theorem 3.3 (noting the 
traps introduced in Step 2 are i.i.d. with an exponential tail, and similarly for the traps 
from Step 3); the crucial fact that G' has a Gheeger constant of at least c/R follows 
immediately from that theorem using the expansion properties of the kernel. ■ 

3.6. The typical distance from the origin. A consequence of our arguments above 
is the following result on the typical distance of random walk from its origin at f x log n. 

Corollary 3.4. Consider random walk {^t) from a uniform vertex vi in Cj either in 
the setting of Theorem 1 or of Theorem 2, let v denote the speed of random walk on the 
corresponding Galton- Watson tree, and let X be the mean of its offspring distribution. 
For every fixed a>0,ift = a log_)y n then dist(r;i, iZl)/ log;^ n —)■ {va A 1) in probability. 
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Proof. First consider the case a < and fix e > 0 small enough such that au < l — e. 
Let T be a GW-tree with an offspring distribution that has the law of Z (this time 
without any truncation), and consider the coupling described in §3.2 of SRW (Xt) on T 
to SRW {^t) on the subtree F C T explored by the standard breadth-first-search of G 
from vi- That is, we first reveal T and the walk {Xi,... ,Xt), then explore the neigh¬ 
borhood of vi in G, and estimate the probability of Uk<tl3cyc{Xk). By the definition 
of in (1.3), the assumption on a implies that maxfc<t dist(/ 9 , X^,) < (1 — e/2) log;, n 
w.h.p., and therefore, at all times k < t, the boundary of the exploration process of 
G (which is at most the size of T) contains at most half-edges. Recalling the 

argument from §3.2 (part (b)), the probability of hitting a cycle at a given step Xk is 
therefore 0 {An~^^^) < n~^l^ for large enough n, and a union bound over k implies 
that w.h.p. Ufc<ti3cyc(Xfc) does not occur and the coupling is successful. In this case, 
dist(ui, = dist(/ 9 , Xt) and the desired result follows from the definition (1.3) of 
Next, consider a > and suppose first that pi = P 2 = 0. As stated in the 

introduction, it is then the case that the diameter of the graph is (1 -|- o{l))D where 
D = log;, n, and it remains to provide a matching lower bound on dist(ui, ^t)- Let 

R= [(logn)^/'‘J 

and 

h = L(1 - e)log;,nj , io = h - R- 
Consider the exploration tree F, as defined above, up to level ii. 


Claim 3.5. Let H be the auxiliary graph on level Iq of T, where {u,v) G E{H) if 
there is an edge of G connecting Fr(u) and Fi^(u). With high probability, the largest 
connected component of H has size at most 10/e. 

Proof. As stated above, since the boundary of F contains at most half-edges, 

the analysis of §3.2 shows that the probability of Bcjci^) is at most 7i-£/3+o(i) for every 
vertex z in F at distance at most ii from vi. Therefore, the degree of a vertex u m. H 
is dominated by a binomial random variable Q Bin(n^/® , n ^/^), where we used that 
|ri?(u)| < = Consequently, the size of the component of u in FT is dominated 

by the total progeny A of a branching process with offspring random variables (/. By 
exploring this branching process via breadth-first-search (or alternatively, through the 
Dwass-Otter formula; see, e.g., [14]), one finds that 


P(A > F) < Pr^(Ci - 1) > o') < P (^Bin(Fn=/®,n"^/^) > ^) < X] 

^ i=i ^ k>i 




n 


—kejA 


k>£ ' ^ k>i 

Taking i > 10/e supports a union bound over u and completes the proof. 


Corollary 3.6. Let H be as defined in Claim 3.5. With high probability, for every 
connected component G of PI there exists an interval I C of length at least 

eR/lQ so that the induced subgraph of G on levels in I o/|J^gpFR(u) is a forest. 
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Suppose that G satisfies the property of the above corollary. We claim that if 
dist(ui, .^k) = h for some k, then w.h.p. (w.r.t. the walk) 

min dist(ni, ^k+j) > ■ 

l<j<log^ n 

Indeed, in light of the above corollary, in order for the walk to reach level io starting 
from level £i, it must cross an interval of length sR/lO where every vertex has at least 
two children in a lower level. The probability that the biased random walk {St), where 
St — St-i is 1 with probability ^ and —1 with probability |, reaches eR/10 before 
returning to the origin is at most < (logn)“®, and taking a union bound over 

the log^ n possible time steps now completes the treatment of the case pi = P 2 = 0- 
Finally, when we allow degrees 1 and 2, recall from §3.4.2 that one can modify the 
random graph G to a graph G' where the long paths and large hanging trees all have 
size O(loglogn), and couple the SRW on G to the SRW on G' w.h.p. for all t < log^ n. 
The only change in the above argument is in the analysis of the biased random walk {St) 
along the interval of length eii/10 that corresponds to a forest in G. There, the problem 
reduces to an interval shorter by a factor of clog log n, and the fact that 
is again at most (logn)“^ for large enough n concludes the proof. ■ 

4. Non-backtracking random walk on random graphs 

In this section we consider the NBRW on a random graph G with i.i.d. degrees with 
distribution {pi)i>o (see the paragraph preceding Theorem 2) where Po = Pi = 0. Recall 
that the NBRW is a Markov chain on the set E of directed edges, which moves from 
e = (n, v) to a uniformly chosen edge among {e' = {v,w) : w ^ u}. Since deg(u) > 2 (as 
Po = Pi = 0) this Markov chain is well-defined for all t, and that the uniform measure 
vr on FI is stationary. Let PQ{e, e') be the f-step transition probability of the NBRW. 
Our assumption on the degree distribution is that 

Mo := ^ A:(log k)‘^pk < oo . (4.1) 

k 

Let A = Y{,k he the mean degree (noting that 2 < A < oo by (4.1)), and let 

qk-i:=kpk/X {k>l). 

be the shifted size-biased distribution. We further assume that the random variable Z 
given by F{Z = k) = qk satisfies 

F{Z > An) = o{l/n) for A„ := exp [(logn)^'^^“'^] . (4.2) 

Recall that the dimension of harmonic measure for NBRW in the GW-tree with offspring 
distribution {qk) is 

d = ^gfclogfc, (4.3) 

k 

which is finite under (4.1). To state our next result, we let 

= niin{t : {t) < e} where {t) = \\Pg{^^ ') ~ ^Ikv ■ 
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Theorem 4.1 (typical starting point). Let G he a random graph on n vertices with 

1.1. d. degrees with distribution {pk)k>o> where po = pi = 0 and (4.1)-(4.2) hold. For 
every fixed 0 < e < 1, with probability at least 1 — e, 

4k(£) = logn + 0(v/logn). 

Theorem 4.2 (worst starting point). In the setting of Theorem f.l with the extra 
assumption p 2 = 0 , for every fixed 0 < e < 1 , with probability at least 1 — e, 

tuixie) = logn + 0(\/logn). 

Remark 4.3. After completing this work we learned that another proof of Theorem 4-2 
was obtained independently by Ben-Hamou and Salez [4] with a weaker assumption on 
A and an asymptotic estimate of the total-variation distance within the cutoff window. 

4.1. Mixing from a typical starting point. 


4.1.1. The truncated exploration process. We first condition on the degree sequence 
of the random graph, {Dy : u G 14}, and may assume, as in §3.2, that (3.15) holds 
by neglecting an event of probability o(l). Let (!}) denote the NBRW, and for every 
directed edge e = (n, v), let = Dy — 1 (the number of directed edges that the NBRW 
can transition to from e). 

Throughout this proof, let be an odd integer given by 


= 2L + 1 where L = 


(2d) ^logn + 2Rrd ^/^y^log 


n 


(4.4) 


for 


K := 8 \/ Mofs with Mq from (4.1) . 


(4.5) 


Let e = (n, v) G E. Let Tg be the tree of depth L constructed as follows. Explore 
the neighborhood of e via breadth-first-search (as explained in §3.2). Exclude every 
half-edge that is matched to a previously encountered half-edge from Tg. Moreover, do 
not explore an edge e' (truncate its subtree) if it satisfies 


-Deo 

eoSl'fe.e') 


d) >kVl, 


(4.6) 


where 'I'(e, e') denotes the directed path from e to e' in Tg. Let dTg denote the half-edges 
at level L that had been encountered (and not yet matched) in this process. 

Next, consider a second edge f = {x,y) G E such that x,y ^ Tg. Define T} to be the 
tree of depth L formed by exploring Tf* corresponding to the reversed edge f* = {y, x), 
as described above, with the following modification: if a half-edge is matched to dTg, 
we do not include it in the tree. Set 


Tg = a{Tg), Ef = a{Tf), Tgj = a{Tg,Tff. 
Consider the sequence (a^) defined by 

Ok := jk with Mq from (4.1) , 


(4.7) 
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and let £ denote the set of all edges e € E such that the NBRW from e remains in Tg 
except with probability ax, for K the constant in (4.5); that is, define 

£ = {eG E : Qe<aK} where Qe := Fg(^\J {Yt ^ E{Te)} \ Yq = e^. 

t<L 

Analogously, let 

£e = [fGE-. Q,J < ax] where Qej := 1Pg( IJ [Yt i | lo = f) ■ 


t<L 


Note that, by labeling the half-edges of each vertex, we may refer to a directed edge 
e = {u, v) also as e = {i, v) where i G {1,..., D^} is the label of a half-edge matched to 
a half-edge of the vertex u. Similarly, we may refer to / = {x,y) as / = {x,j) where j 
corresponds to a half-edge j G {1,..., Dy} matched to a half-edge of y. 

With this notation, each half-edge in dTe of the form {u',i) for some u' G V and 
z G {1,..., Dy/} corresponds to a unique edge in E, which we include in E{Te), the set 
of directed edges induced on Tg. We claim that, for each I < k < L, 

# |e' G E{Te) : dist(e, e') = A:} < . (4.8) 


Indeed, by condition (4.6), and using its notation as well as (4.4), for every such 
Pg(y^ = e') = n ~ = n-V 2 -o(i) _ 


eoS'I'(e,e') 


e', 

(4.9) 


This implies (4.8) since the events {Y), = e'} are disjoint for all edges eh In particular, 
using that A„ = and L = O(logn), the total number of half-edges encountered in 
the exploration process of Tg is at most ^ and, similarly, this also holds for T^. 

We next estimate the probability that e G £ and f G £e for some fixed e, /. 


Lemma 4.4. For every two edges e = {i,v), f = {x,j) (with 1 <i < Dy, I < j < Dx), 
P(e ^£) <ax , IP’(/ i £e I Te) < ax ■ 


Proof. The argument in §3.2 shows that the NBRW (Y))t>o from Yq = e on the random 
graph G can be coupled to a NBRW {f,t)t>o on a (non-truncated) GW-tree T w.h.p., as 
only the events Bcyc and Bdeg are to be avoided {avoiding a cycle and coupling vertex 
degrees, respectively) when no truncation is performed on the tree. 

On the GW-tree, the random variables (Z)^Jj>i, recording the number of offspring 
of each site visited by the NBRW, are i.i.d. with distribution given by {qk)k>Q, and in 
particular, E[logT)^J = d and E[log^(i4^J] < Mq for Mq from (4.1). Hence, 


max 

1<A:<L 


^(logZ)^,-d) >K^ 


2=1 


< 


R2 


by Kolmogorov’s Maximal Inequality. Hence, the above coupling of (^j) and (Yj), which 
succeeds with probability 1 — o(l), implies that 

nQe] < Mo/K^ + o(l) = (i + 0 {l))a\ . 
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It thus follows from Markov’s inequality that 

P {Qe > ax) < —^ (5 + o{l))aK < ax , 
ax 

provided that n is large enough. The statement concerning the NBRW from f* follows 
in the exact same manner. ■ 

4.1.2. Estimating the distribution of the walk via two truncated trees. We wish to bound 
the quantity PQ{e, f) from below. Note that, since t* = 2L + 1, 

4 *(e,/)> P^(e,eo)l{eo=/o}P’G(/o,/), 

eo^dTe 

where the event {eo = /o} for eo = {uQ,i) G dT^ and /o = ixo,j) G dTJ is equivalent 
to having the i-th half-edge incident to x be matched with the j-th edge incident to y. 
Further note that PQ[{j,xo), {v,v')) = Pq[{v',v), (xo,j)). Hence, 

where Pt{-, •) corresponds to the walk restricted (not conditioned) to remain in E{T). 
We will therefore estimate E[Zej] and then its variance. 

Lemma 4.5. In the setting of Lemma A = {e € £} Ci {f € £/}, which is 

Pf,j-ixieasurahle. Then for every sufficiently large n, 

E[Zej|T-e,/]>m^lA, (4.10) 

\p\ 

Var (Zej I Pej) < rr'^ exp ^/\og . (4.11) 

Proof. Recall that P(eo = /o | Pej) = l/d-^l “ ^-e,/): where the T),,/-measurable 
random variable uig j denotes the number of half-edges matched during the exploration 
process in Pej, which satisfies niej < by (4.8) and the discussion below it. 

Since P^^{e,eo) and Rfe(/,/o) are both Tg,/-measurable, we find that 

E[z.,/1 A/i =—( E pt.Geo))( E ppr^fS) 

1^1-me ,/' 

> ^(1 - ge)(l - Qej) > i(l - 2afc)lA , 

\p\ \p\ 

using the definitions of e G T and / G Tg and that (1 — ax)'^ > 1 — ‘^ax- This 
yields (4.10). 

To establish (4.11), note that by definition, 

Var(Zg,^|.T)< ^ P#^(e, eo)Pi((e, ei)Rfe(r,/*)P#e(r,/^ 

eo,eiedTe foJi&dTj 

■ Cov(l|ejj=jg|, l{ej=j^} I P) . 
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Clearly, if eo 7 ^ ei or /o 7 ^ /i then the term Cov(l{gj,=jg|, l{ei=/i} I equals 
= /o I ^)fP(ei = /i I eo = /o) - lP(ei = /i | -F) 


< 


|£'| - mej Vl-El - me,/- 2 |^|-me,/ 


2 + 0 ( 1 ) 
\E? 


where in the last inequality we used that me,/ = o(n) while \E\ > 2n. On the other 
hand, for the diagonal terms where eo = ei and fo = fi, we have 

Var (l|eo=/o} I -Fe,/) < P (eo = /o I -Fe,/) = ^ ^ 




Overall, using \E\ > 2n, we conclude that 


Var {Ze f \ Eg f) < ^ H- ^ ( max (e, eo)^ ( max Pfe(/*, /q)^ 

V e,/ I e,/; _ ^^3 VeoSaTe 


4n3 ' 2 ?t, \eoedfe 

Now, if eo G dTg, then the criterion (4.6) implies that 


Pm(e,eo) < e 


-Ld+KVL _ 1 


n 


exp 


-' 2-^+0 


V/osaT; + 

iird~^/^y^logn 




n 


for n sufficiently large, and the same holds for Pre(/*,/o). Consequently, 


1 + 0 ( 1 ) 


Var (Zg,/ I Pg,/) < — ^ - exp 

and the result follows. 


2 n 2 


(^-Kd ^^Vlogn^ , 


Remark 4.6. The expectation estimate (4.10) remains valid for all smaller L; it is for 
the variance estimate (4.11) to hold that we must choose L as in (4.4). 

Corollary 4.7. In the setting of Lemma f. 4 , for every sufficiently large n, 

P (■Z'e,/ < (1 - Mk)/\E\ , / G Te I Tg^ l{ee£} ^ ^Xp (^-y/log . 

Proof. Since \E\ = 0{n), we deduce from Lemma 4.5 via Chebyshev’s inequality that 


EeJ < 


1 — HttK 

\E\ 


EeJ 1 a < 


^-Kd-^/'^EEgn 

nHaK/\E\r 


„—Kd l/^+logn 


(4.12) 


Taking expectation given Tg yields the desired result. 


4.1.3. Upper bound on the mixing time in Theorem f.l. Let N = \E\ and e = (z, v) G E. 
Recalling that P^fe, f) > Zgj, it follows that 


E 


^TV 


Ifeea} = E 




< 


He&S} 

N 


E[#{/:/^Te}|Tg] + 


Reeg} 

N 


E 


#{/ 


L{eeg} 


G Tg : Zgj < 


1 — Sax 5 
N / 


+ 3ax 
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The first term in the right-hand is at most aK by Lemma 4.4, and the second is o(l) 
by Corollary 4.7. Therefore, 


E 


4v(t*) I Te Ijeef} < 4o;^ o(l) . 


(4.13) 


Taking expectation over a uniform e € E and using Lemma 4.4 once again, we get 


E 


^TV 


< Bqk + o(l) < e 


by the choices of ax and K. 


4.1.4. Lower bound on the mixing time in Theorem 4-1- Now consider 


L' = 


d ^ log n — 2K'd ^logn , where K' = ^/MqJe . 


Dehne the truncated tree Tg as in §4.1.1 (with K',L' replacing K,L). As in (4.9), 
Ee(Tfc = e') > e“ 
for every 1 < A: < L', and therefore 


> lg(A"-o{i))d-i/2yi^ 

“ n 


# |e' E E{Te) : dist(e, e') = A:} < l<k<L'. 

Since the coupling argument of §3.2 was valid as long as the size of the truncated tree 
did not exceed nexp(—c\/Iog~n) for some fixed c > 0, we can repeat the analysis in the 
proof of Lemma 4.4 to find that 


•(U {F,^E(r.)}|r„ = e)<^ + 


t<L' 


where the probability is over G, (Yt). The fact \E{Te)\ = 


(1) — e -I- o(l), 

o(n) completes the proof. ■ 


4.2. Mixing from a worst starting point with minimum degree 3. We now 

describe how to modify the proof of Theorem 4.1 and derive the proof of Theorem 4.2. 
Since the lower bound applies also to the worst starting point, it remains to adapt the 
proof of the upper bound. 

Recalling (4.13), we have in fact established not only an upper bound on E[dTv^ (A)] 
a uniform starting edge e, but rather from every edge e E T, conditional on Tg. Denote 
by the depth-A: truncated tree from §4.1.1, take and L as defined in (4.4) (so that 
is Tg from that section), take K as in (4.5) and R = 21og2logn. Further denote 
by Tg^’^ the tree obtained by first exposing T^, then exposing Tg^ sequentially for each 
eo E dT^, while (as before) excluding edges that form cycles / loops. 

We will argue that, w.h.p., for every e £ E, the truncated tree satishes 

P§ie,Se)>l-2aK, (4.14) 

where (a^) is as defined in (4.7) (so that ax e), and 

Se = {eo E : Qe, < ax} for := Eg( IJ ^ I = eo) ■ 

t<L 
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(That is, the NBRW started at cq does not exit , which, as opposed to the original 
definition of S, also precludes it from visiting the subtree for another ei G dT^). 
The proof of (4.13) also yields that 


E 


^TV 


nR:L 


l{eoe£e} ^ + o(l). 


Hence, the upper bound will follow from (4.14) using the decomposition 


Ph*^^{e,-)= E PGie,eo)Pljieo,-)- 

eoGdTR 


To prove (4.14), note that the proof of Lemma 4.4 further implies that 

F{ej+i E Se I {Tei}i=i) > (1 - for all ei,..., ej+i E dT^ . 

The indicators of the events {ej+i E Ui<j Pej} (j = 1; 2,..., IclTg^l) are stochastically 
dominated by i.i.d. Bernoulli random variables with parameter 
Thus, appealing to Hoeffding-Azuma for the random variable 


5e:= P^(e,e,)l^g^.,^-^j=Eg(e,4) 

Cj&dTR 

(while noting that PQ{e, eo) < 2“^ < log“^ n by the definition of R and the assumption 
on the minimum degree assumption), we get 

IF’(*S'e < 1 — 2aj^) < exp (—log^ n) , 

which, through a union bound on e, establishes (4.14) and concludes the proof. ■ 
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